Locality Sensitive Hashing, Jaccard Similarity, Duplicate Detection, Document Clustering

Feeds to Scour
SubscribedAll
Scoured 9563 posts in 2.23 s
Performance Hints for BigQuery
trmlabs.com·5h·
Discuss: Hacker News
🚀Query Optimization
Preview
Report Post
Natural language processing for word sense disambiguation and information extraction
arxiv.org·15h·
Discuss: r/compsci
📥Feed Aggregation
Preview
Report Post
What Deep Learning Theory Teaches Us About AI Memory
dev.to·1d·
Discuss: DEV
🧠Learned Compression
Preview
Report Post
The power of box dimension attacks on the Epstein files
news.ycombinator.com·14h·
Discuss: Hacker News
🔄Burrows-Wheeler
Preview
Report Post
T3X.ORG nmhbasic/index
t3x.org·10h·
Discuss: Hacker News
📺VT100
Preview
Report Post
TRUNAJOD: A text complexity library for text analysis built on spaCy — TRUNAJOD 0.1.1 documentation
trunajod20.readthedocs.io·9h
📝Parsing Grammars
Preview
Report Post
Optimizing Bracha's Reliable Broadcast: Shaving Rounds off a 37-Year-Old Algorithm
blog.can.ac·3d
🤝Paxos Consensus
Preview
Report Post
Soft Filtering: Guiding Zero-shot Composed Image Retrieval with Prescriptive and Proscriptive Constraints
arxiv.org·2d
🧮Vector Embeddings
Preview
Report Post
wwes4/AI_Accel_1.5x: AI acceleration framework for ~1.5x speedups in mid-sized models via tension-based pruning. Built utilizing xAI's Grok.
github.com·1d·
Discuss: Hacker News
📊Quantization
Preview
Report Post
Document search using Claude and an inverted index.
annanay.dev·4d·
Discuss: Hacker News
🗂️Vector Databases
Preview
Report Post
MCAP Indexing — Monday Morning Haskell
mmhaskell.com·5d
🔄Burrows-Wheeler
Preview
Report Post
Why MAP and MRR Fail for Search Ranking (and What to Use Instead)
towardsdatascience.com·2d
📊Search Ranking
Preview
Report Post
Redis Threading Model: Debunking the Single-Threaded Myth
redis.io·3d·
Discuss: DEV
Redis Internals
Preview
Report Post
Climate Monitoring Search Engine: Multi-Vectors in Qdrant
pub.towardsai.net
·4d
🗂️Vector Search
Preview
Report Post
Real Time Detection and Quantitative Analysis of Spurious Forgetting in Continual Learning
arxiv.org·2d
💻Local LLMs
Preview
Report Post
The Little Book of Python Anti-Patterns — Python Anti-Patterns documentation
docs.quantifiedcode.com·2d·
Discuss: Hacker News
Format Verification
Preview
Report Post
nix-community/lila: Nix hash collection software, to aggregate build reports from several builders [maintainer=@JulienMalka, @raboof]
github.com·3d
❄️Nix Flakes
Preview
Report Post
<p>**Abstract:** The escalating sophistication of malware necessitates advanced detection techniques beyond signature-based or heuristic approaches. We introduc...
freederia.com·3d
🦠Malware Analysis
Preview
Report Post
3 Smart Ways to Encode Categorical Features for Machine Learning - MachineLearningMastery.com
machinelearningmastery.com·5d
🧠Machine Learning
Preview
Report Post